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DESCRIPTION 

Novel N-Acetylglucosaminyltransferase, Nucleic Acid Coding for the Same, and 
Uses Thereof for Diagnosis of Cancer or Tumor 

Technical Field 

5 The present invention relates to a novel enzyme having an activity to transfer 

Af-acetylglucosamine to a non-reducing terminal of Galpl-4Glc or Galpl-4GIcNAc 
group through pi, 3 -linkage, and to a nucleic acid coding for the same, as well as to 
nucleic acids for measuring the nucleic acid. The present invention further relates 
to diagnosis of cancer or tumor using the expression amount of the above-mentioned 
1 0 enzyme or the gene thereof as an index. 

Background Art 

Five types of enzymes are known, having an activity to transfer 7V- 
acetylglucosamine to a non-reducing terminal of Gaipi-4Glc or Gaipi-4GlcNAc 
group through pi, 3 -linkage, which activity is involved in the synthesis of 

15 polylactosamine sugar chains (Togayachi, A. et al., J Biol Chem, 2001, 276, 22032- 
40; Shiraishi, N. et al., J Biol Chem, 2001, 276, 3498-507; Sasaki, K et al., Proc Natl 
Acad Sci USA, 1997, 94, 14294-9). However, although the amount of 
polylactosamine on cell surfaces is increased by making the cells express the gene of 
the enzyme, some of the enzymes expressed have very low activities. Thus, 

2 0 although it is thought that the enzymes which produce polylactosamine have different 
characteristics, the characterization of the enzymes has not been sufficient. 
Therefore, to prepare or produce the polylactosamine sugar chain structure which 
requires the enzyme activity, it is necessary to chemically synthesize the structure, 
isolating the structure from a biological component or to synthesize the structure 

2 5 enzymatically using a tissue homogenate. 

It is known that sugar chain structures such as Lewis antigen exist on the 
sugar chain structures based on polylactosamine sugar chains (Kannagi R. Glycoconj 



2 



J. 1997 Aug;14(5):577-84. Review; Nishihara S et al., J Biol Chem. 1994 Nov 
1 8;269(46):2927 1 -8). Similarly, it is said that the structures such as the lengths of 
polylactosamine sugar chains are involved in cellular immunity by NK cells or the 
like (Ohyama C et a., EMBO J. 1999 Mar 15;18(6):1516-25). Similarly, it is known 
5 that human stomach tissue is infected with Helicobacter pylori through a related 
sugar chain such as Lewis antigen (Wang G et al., Mol Microbiol. 2000 
Jun;36(6):l 187-96. Review; Falk PG et al., Proc Natl Acad Sci USA. 1995 Feb 
28 ;92(5) : 1 5 1 5-9). Thus, if the gene of an enzyme having an activity to transfer N- 
acetylglucosamine to a non-reducing terminal of Gaipi-4Glc or Gaipi-4GlcNAc 

10 group through pi,3-linkage can be cloned, and if the enzyme can be produced by a 
genetic engineering process using the gene, an antibody to the enzyme may also be 
produced. Therefore, these are useful for the diagnoses, therapies and prophylactics 
of cancers, immune diseases and infectious diseases by pylori. However, the 
enzyme has not yet been purified or isolated, and there is no clue to the isolation of 

1 5 the enzyme and identification of the gene. As a result, an antibody to the enzyme 
has not been prepared. 

Disclosure of the Invention 
Accordingly, an object of the present invention is to provide an enzyme 
having an activity to transfer JV-acetylglucosamine to a non-reducing terminal of 

2 0 Galp 1 -4Glc or Galp 1 -4GlcNAc group through p 1 ,3-linkage, and a nucleic acid 
coding for the same. Another object of the present invention is to provide a 
recombinant vector which expresses the above-mentioned the nucleic acid in a host 
cell, to provide a cell in which the nucleic acid is introduced and which expresses the 
nucleic acid and the enzyme protein, and to provide the enzyme protein. Still 

2 5 another object of the present invention is to provide a nucleic acid for measurement 
of the above-mentioned nucleic acid according to the present invention, and to 
provide a method for producing the enzyme having the activity. 
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As mentioned above, since the enzyme of interest has not been isolated, it is 
impossible to know its partial amino acid sequence. In general, it is not easy to 
isolate and purify a protein contained in cells in a trace amount, and so isolation of 
the enzyme from cells, which has not been isolated so far, is expected not easy. The 
5 present inventors thought that if there is a homologous region among the nucleotide 
sequences of the various enzyme genes, which enzymes have relatively similar 
actions to that of the enzyme of interest, the gene of the enzyme of interest may also 
have the homologous sequence. After searching the nucleotide sequences of the 
known pl,3-N-acetylglucosaminyltransferase genes, pl,3-galactoslytransferase genes 

1 0 and p 1 ,3 -N-acetylgalactosaminy transferase genes, a homologous region was 

discovered. Thus, based on the cloning by PCR using cDNA library, in which a 
primer was set in the homologous region, and after various considerations, the 
present inventors succeeded in the cloning of the gene of the enzyme, and its 
nucleotide sequence and the deduced amino acid sequence were determined, thereby 

1 5 accomplishing the present invention. 

That is, the present invention provides a protein having the amino acid 
sequence shown in SEQ ID NO: 1 in SEQUENCE LISTING, or a protein having the 
same amino acid sequence as shown in SEQ ID NO.T except that one or more amino 
acids are substituted or deleted, or that one or more amino acids are inserted or added, 

2 0 which has an activity to transfer N-acetylglucosamine to a non-reducing terminal of 
Galp 1 -4Glc or Galp 1 -4GlcNAc group through p 1 ,3 -linkage. The present invention 
also provides a nucleic acid coding for the protein. The present invention further 
provides a recombinant vector containing the nucleic acid, which can express the 
nucleic acid in a host cell. The present invention still further provides a cell which 

2 5 is transformed by the recombinant vector, which expresses the nucleic acid. The 

present invention still further provides a nucleic acid for measurement of the nucleic 
acid, which specifically hybridizes with the nucleic acid. The present invention still 



further provides use of the nucleic acid for measurement for the diagnosis of a cancer 
or tumor. The present invention still further provides a method for diagnosis of a 
cancer or tumor, comprising determining the amount of the above-mentioned enzyme 
or determining the expression amount of the gene coding for the enzyme, in (a) 
sample cell(s) separated from body. The present invention still further provides a 
method for measuring the above-mentioned nucleic acid according to the present 
invention, comprising annealing the nucleic acid for measurement of nucleic acid, 
according to the present invention, and the above-described nucleic acid according to 
the present invention so as to hybridize them, and measuring the hybridized nucleic 
acid. The present invention still further provides use of the nucleic acid for 
measurement of nucleic acid, according to the present invention, for the production 
of nucleic acid for measurement of nucleic acid according to the present invention. 
The present invention still further provides use of the nucleic acid for measurement 
of nucleic acid, according to the present invention, for the production of diagnostic 
reagent for a cancer and/or tumor. 

By the present invention, an enzyme having an activity to transfer N- 
acetylglucosamine to a non-reducing terminal of Gaipi-4Glc or Gaipi-4GlcNAc 
group through pi,3-linkage, and a nucleic acid encoding the enzyme were first 
provided. Further, by the present invention, a nucleic acid for measuring the above- 
mentioned nucleic acid was first provided. Still further, a simple and accurate 
method for diagnosis of a cancer or tumor, especially a cancer or tumor of digestive 
organs, and a nucleic acid for measurement used therefor were first provided. Thus, 
it is expected that the present invention will greatly contribute to the diagnoses of 
cancers and tumors of digestive organs. 

Brief Description of the Drawings 

Fig. 1 shows the results of the flow cytometry showing the binding property 
between the HCT15 colon cancer cell line and the LEA lectin, the cell line being 
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transformed with a recombinant vector into which the gene of the present invention 
was incorporated or with a recombinant vector into which the gene of the present 
invention was not incorporated. 

Fig. 2 shows the results of the flow cytometry showing the binding property 
5 between the LSC colon cancer cell line and the LEA lectin, the cell line being 

transformed with a recombinant vector into which the gene of the present invention 
was incorporated or with a recombinant vector into which the gene of the present 
invention was not incorporated. 

Fig. 3 shows the results of the flow cytometry showing the binding property 
10 between the HCT15 colon cancer cell line and the WGA lectin, the cell line being 
transformed with a recombinant vector into which the gene of the present invention 
was incorporated or with a recombinant vector into which the gene of the present 
invention was not incorporated. 

Fig. 4 shows comparison of the amount of expression of the gene according to 
1 5 the present invention in normal tissues and that in cancer tissues of colon cancer 
patients. 

Best Mode for Carrying out the Invention 
The nucleic acid resulting from the removal of the initiation codon (ATG) 
from the nucleic acid encoding the protein of the present invention, which was 

2 0 cloned from a human antrum cDNA library by the method that will be described in 
detail in the Examples below, has the nucleotide sequence shown in SEQ ID NO: 4 
in the SEQUENCE LISTING, and the deduced amino acid sequence encoded thereby 
is described below the nucleotide sequence. In SEQ ID NO:3, the amino acid 
sequence alone is shown. In the Examples below, the nucleic acid having the 

25 nucleotide sequence shown in SEQ ID NO:4 was incorporated into an expression 

vector, expressed in insect cells and it was confirmed that a protein having the above- 
mentioned enzyme activity was produced. By comparing the amino acid sequence 



shown in SEQ ID NO: 3 and the amino acid sequence of a similar enzyme (concrete 
enzyme name: p3GnT2 : AB049584 which is the gene of p-l,3-Af- 
acetylglucosaminyltransferase), it is thought that the region with a relatively high 
homology, that is, the region from the 45th amino acid to the C-terminal of the amino 
acid sequence shown in SEQ ID NO: 3 is the active domain of the enzyme, and that 
the above-mentioned enzyme activity is exhibited if this region consisting of 283 
amino acids is contained. This 283 amino acids is shown in SEQ ID NO:l and the 
nucleic acid encoding this, taken out from SEQ ID NO:4, is shown in SEQ ID NO:2. 

The protein (named "(53GnT-7 n ) according to the present invention obtained 
in the Examples below is an enzyme having the following characteristics. Each of 
the characteristics as well as the methods for measuring them are described in detail 
in the Examples below. 

Action: Transfer Af-acetylglucosamine to a non-reducing terminal of Galp 1 - 

4Glc group or Galp 1 -4GlcNAc group through p 1 ,3-linkage. The reaction catalyzed 
by the enzyme, expressed in terms of reaction equation, is as follows: UDP-JV- 
acetyl-D-glucosamine + p-D-galactosyl-l,4-D-glucosyl-R — > UDP + //-acetyl- P-D- 
glucosaminyl- 1 ,3-p-D-galactosyl- 1 ,4-D-glucosyl-R, or UDP-N-acetyl-D-glucosamine 
+ P-D-galactosyl-l,4-N-acetyl-D-glucosaminyl-R — UDP + 7/-acetyl-p-D- 
glucosaminyl-l,3-p-D-galactosyl-l,4-A^-acetyl-D-glucosaminyl-R 
Substrate Specificity: Galp 1 -4Glc group or Galp 1 -4GlcNAc group. In 

biological substances, these groups occurs abundantly as, for example, 
polylactosamine structures in glycoproteins (O-glycans and N-glycans) and 
glycolipids (lacto neolacto series sugar chains and the like). Further, the Gaipi- 
4Glc groups or Gaipi-4GlcNAc groups contained in the basal structures of 
proteoglycans (keratan sulfate) and the like. 

In general, it is well-known in the art that there are cases wherein the 
physiological activity of a physiologically active protein such as an enzyme is 



retained even if the amino acid sequence of the protein is modified such that one or 
more amino acids in the amino acid sequence is substituted or deleted, or one or 
more amino acids are inserted or added to the amino acid sequence. Therefore, a 
protein having the same amino acid sequence as shown in SEQ ID NO:l or 3 except 
that one or more amino acids are substituted or deleted, or one or more amino acids 
are inserted or added, which protein has an activity to transfer N-acetylglucosamine 
to a non-reducing group of Galpl-4Glc or Gaipi-4GlcNAc group through pi,3- 
linkage (the protein is hereinafter referred to as "modified protein" for convenience) 
is also within the scope of the present invention. The amino acid sequence of such a 
modified protein preferably has a homology of not less than 70%, preferably not less 
than 90%, still more preferably not less than 95% to the amino acid sequence shown 
in SEQ ID NO: 1 or 3. The homology of the nucleotide sequence may easily be 
calculated by using a well-known software such as FASTA, and such a software is 
available on the internet. Further, as the modified protein, one having the same 
amino acid sequence as shown in SEQ ID NO:l or 3 except that one or several amino 
acids are substituted or deleted, or that one or several amino acids are inserted or 
added is especially preferred. Further, a protein containing the protein having the 
amino acid sequence shown in SEQ ID NO:l or 3, or a modified protein thereof, 
which has an activity to transfer N-acetylglucosamine to a non-reducing terminal of 
Gaipi-4Glc or Galpl-4GlcNAc group through pi,3-linkage is also within the scope 
of the present invention. For example, in the Examples below, a nucleic acid 
encoding a membrane-bound type enzyme, in which a transmembrane region is 
ligated to the upstream of the amino acid sequence shown in SEQ ID NO:3 was also 
cloned, and such a membrane-bound type enzyme is also within the scope of the 
present invention. 

The present invention also provides nucleic acids coding for the amino acid 
sequence shown in SEQ ID NO: 1 or 3 and nucleic acids coding for the amino acid 
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sequences of the above-mentioned modified proteins. As the nucleic acid, DNA is 
preferred. As is well-known, due to degeneracy, there may be a plurality of codons 
each of which codes for the same single amino acid. However, as long as a nucleic 
acid codes for the above-described amino acid sequence, any nucleic acid having any 
nucleotide sequence is within the scope of the present invention. The nucleotide 
sequences of the cDNA actually cloned in the Examples below are shown in SEQ ID 
NOs:2 and 4. Those nucleic acids which hybridize with the nucleic acid having the 
nucleotide sequence shown in SEQ ID NO:2 or 4 under stringent conditions (i.e., 
hybridization is performed at 50 to 65°C using a common hybridization solution such 
as 5 x Denhardt's reagent, 6 x SSC, 0.5% SDS or 0.1% SDS), and which code for the 
above-described modified proteins are within the scope of the present invention. 

The above-described nucleic acid according to the present invention can be 
prepared by the method described in detail in Example below. Alternatively, since 
the nucleotide sequence was clarified by the present invention, it can easily be 
prepared by using human antrum as the material and performing the well-known RT- 
PCR method. The above-described protein according to the present invention can 
also be easily prepared by, for example, incorporating the above-described nucleic 
acid according to the present invention into an expression vector, expressing the 
nucleic acid in a host cell, and purifying the produced protein. 

By inserting the above-described nucleic acid according to the present 
invention into a cloning site of an expression vector, a recombinant vector which can 
express the above-described nucleic acid in a host cell may be obtained. As the 
expression vector, various plasmid vectors and virus vectors for various host cells are 
well-known and commercially available. In the present invention, such a 
commercially available expression vector may preferably be employed. The 
methods for transforming or transducing host cells with such a recombinant vector 
are also well-known. The present invention also provides a cell into which the 



nucleic acid according to the present invention is introduced by transformation, 
transduction or transfection, which expresses the nucleic acid. The methods per se 
for introducing a foreign gene into a host cell are well-known, and the introduction of 
the foreign gene may easily be attained by, for example, using the above-mentioned 
recombinant vector. An example of the construction of a recombinant vector and a 
method for introducing the nucleic acid according to the present invention into host 
cells using the recombinant vector are described in detail in the Examples below. 

Sugar chains may be bound to the protein according to the present invention, 
as long as the protein has the amino acid sequence described above and has the 
above-described enzyme activity. In other words, the term "protein" used herein 
also includes "glycoprotein". 

Since the nucleotide sequence of the cDNA of the novel enzyme according to 
the present invention was clarified by the present invention, nucleic acids for 
measurement according to the present invention (hereinafter referred to as simply 
"nucleic acid for measurement"), which specifically hybridize with the mRNA or the 
cDNA of the enzyme, were provided by the present invention. The term 
"specifically" herein means that the nucleic acid does not hybridize with other nucleic 
acids existing in the cells subjected to the test and hybridizes only with the above- 
described nucleic acid according to the present invention. Although it is preferred, 
in general, that the nucleic acid for measurement has a sequence homologous with a 
part of the nucleic acid having the nucleotide sequence shown in SEQ ID NO:2 or 4, 
mismatch of about 1 or 2 bases does not matter in many cases. The nucleic acid for 
measurement may be used as a probe or a primer in a nucleic acid-amplification 
method. To assure specificity, the number of bases in the nucleic acid for 
measurement is preferably not less than 15, more preferably not less than 18. In 
cases where the nucleic acid is used as a probe, the size is preferably not less than 15 
bases, more preferably not less than 20 bases, and not more than the full length of the 
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coding region. In cases where the nucleic acid is used as a primer, the size is 
preferably not less than 15 bases, more preferably not less than 18 bases, and less 
than 50 bases. The methods for measuring a test nucleic acid using a nucleic acid 
having a sequence complementary to a part of the test nucleic acid as a primer of a 
5 gene-amplification method such as PCR or as a probe are well-known, and the 

methods by which the mRNA of the enzyme according to the present invention was 
measured by Northern blot or in situ hybridization are concretely described in detail 
in the Examples below. In the present specification, "measurement" includes 
detection, quantification and semi-quantification. 

1 0 The nucleic acid-amplification methods such as PCR are well-known in the 

art, and reagent kits and apparatuses therefor are commercially available, so that they 
may easily be carried out. That is, for example, a test nucleic acid serving as a 
template (e.g., the cDNA of the gene of the enzyme of the present invention) and a 
pair of nucleic acids for measurement (primers) according to the present invention 

15 are mixed in a buffer in the presence of Taq polymerase and dNTPs, and the steps of 
denaturation, annealing and extension are carried out by changing the temperature of 
the reaction mixture. Usually, the denaturation step is carried out at 90 to 95°C, the 
annealing step is carried out at Tm between the template and the primers or a vicinity 
thereof (preferably within ±4°C) 5 and the extension step is carried out at 72°C which 

20 is the optimum temperature of Taq polymerase. The reaction time of each step is 
selected from about 30 seconds to 2 minutes. By repeating this thermal cycle for 
about 25 to 40 times, the region between the pair of primers is amplified. The 
nucleic acid-amplification method is not restricted to PCR, but other nucleic acid- 
amplification methods well-known in the art may also be employed. By carrying 

2 5 out the nucleic acid-amplification method using a pair of the above-described nucleic 
acids for measurement according to the present invention as primers and using the 
test nucleic acid as a template, the test nucleic acid is amplified. In contrast, in 
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cases where the test nucleic acid is not contained in the sample, the amplification 
does not occur. Therefore, by detecting the amplification product, whether the test 
nucleic acid exists in the sample or not may be determined. Detection of the 
amplification product may be carried out by a method in which the reaction solution 
5 after the amplification is subjected to electrophoresis, and the bands are stained with 
ethidium bromide or the like, or by a method in which the amplification product after 
electrophoresis is immobilized on a solid phase such as a nylon membrane, a labeled 
probe which specifically hybridizes with the test nucleic acid is hybridized with the 
test nucleic acid, and the label after washing is detected. Alternatively, the test 

1 0 nucleic acid in the sample may be quantified by the so called realtime detection PCR 
using a quencher fluorescent pigment and a reporter fluorescent pigment. Since the 
kits for realtime detection PCR are also commercially available, realtime detection 
PCR may also be carried out easily. The test nucleic acid may also be semi- 
quantified based on the intensity of the band resulted in electrophoresis. The test 

1 5 nucleic acid may be a mRNA or a cDNA reverse-transcribed from a mRNA. In 
cases where a mRNA is amplified as the test nucleic acid, NASBA method (3SR 
method, TMA method) using the above-described pair of primers may also be 
employed. NASBA method per se is well-known, and kits therefor are 
commercially available, so that NASBA method may easily be carried out using the 

2 0 above-described pair of primers. 

As the probe, labeled probe obtained by labeling the above-described nucleic 
acid for measurement with a fluorescent label, radioactive label, biotin label or the 
like may be used. The methods per se for labeling a nucleic acid are well-known. 
Whether the test nucleic acid exists in the sample or not may be determined by 

2 5 immobilizing the test nucleic acid or amplification product thereof, hybridizing the 
labeled probe therewith, and measuring the label bound to the solid phase after 
washing. Alternatively, the nucleic acid for measurement is immobilized, the test 



12 



nucleic acid is hybridized therewith, and the test nucleic acid bound to the solid 
phase is detected by a labeled probe or the like. In such a case, the nucleic acid for 
measurement immobilized on the solid phase is also called a probe. The methods 
for measuring a test nucleic acid using a nucleic acid probe are also well-known in 
the art, and may be attained by making contact between the nucleic acid probe and 
the test sample in a buffer at Tm or a vicinity thereof (preferably within ±4°C) so as 
to hybridize them, and then measuring the hybridized labeled probe or the test 
nucleic acid bound to the immobilized probe. Such a method includes well-known 
methods such as Northern blot and in situ hybridization described in the Examples 
below, as well as Southern blot. 

By making the enzyme according to the present invention act on a 
glycoprotein, oligosaccharide or polysaccharide having (a) Gaipi-4Glc or Gaipi- 
4GlcNAc group(s), 7V-acetylglucosamine is bound to the non-reducing terminal(s) of 
the Galp 1 -4Glc or Galp 1 -4GlcNAc group(s) through P 1 ,3-linkage. Thus, the 
enzyme according to the present invention may be used for modification of sugar 
chains of glycoproteins and for synthesis of saccharides. Further, by administering 
this enzyme as an immunogen to an animal, an antibody to this enzyme may be 
prepared, so that the enzyme may be measured by an immunoassay using the 
antibody. Therefore, the enzyme according to the present invention and the nucleic 
acid coding for the enzyme are useful for the preparation of such an immunogen. 
Such an antibody and the above-described nucleic acid for measurement are useful 
for the measurement of the enzyme in the body, and the measurement is useful for 
the diagnoses, therapies and preventions of cancers, immune diseases and infectious 
diseases by pylori. 

The antibody, preferably the monoclonal antibody, which reacts with the 
enzyme of the present invention by antigen-antibody reaction, may be prepared by a 
well-known method comprising administering the enzyme of the present invention as 
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an immunogen to an animal. Such an antibody may be used for the diagnoses of 
cancers or tumors, preferably cancers or tumors of digestive organs, especially cancer 
or tumor of colon, more preferably, for the diagnosis of colon cancer. In cases 
where the antibody is used for the diagnosis of a cancer or tumor, the above- 
5 described enzyme is measured by an immunoassay utilizing the antigen-antibody 
reaction between the enzyme in the sample cells and the antibody, and the result is 
compared with the measurement results obtained for normal cells. If the measured 
amount of the enzyme is smaller than that in the normal cells, especially if the 
enzyme is not detected, it is judged that the possibility that the sample is a cancer or 

1 0 tumor is high. The immunoassays per se are well-known, and any of the well- 
known immunoassays may be employed. That is, classifying the known 
immunoassays according to the reaction type, known immunoassays include 
sandwich immunoassays, competition immunoassays, agglutination immunoassays, 
Western blot and the like. Classifying the known immunoassays according to the 

1 5 label employed, known immunoassays include fluorescence immunoassays, enzyme 
immunoassays, radio immunoassays, biotin immunoassays and the like. Any of 
these immunoassays may be employed. Further, diagnosis may be attained by 
immunohistostaining. In cases where a labeled antibody is used in the 
immunoassay, the methods per se for labeling an antibody are well-known, and any 

20 of the well-known methods may be employed. It is known that by decomposing an 
antibody with papain or pepsin, an antibody fragment such as Fab fragment or F(ab ! )2 
fragment having the binding ability with the corresponding antigen (such a fragment 
is called "antigen-binding fragment" in the present specification) is obtained. The 
antigen-binding fragments of the antibody of the present invention may also be used 

25 in the same manner as the antibody. 

These immunoassays per se are well-known in the art, and so it is not 
necessary to explain these immunoassays in the present specification. Briefly, in 
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sandwich immunoassays, for example, the antibody of the present invention or an 
antigen-binding fragment thereof is immobilized on a solid phase as a first antibody. 
The first antibody is then reacted with a sample, and after washing the solid phase, 
the resultant is then reacted with a second antibody which reacts with the enzyme of 
5 the present invention by antigen-antibody reaction. After washing the solid phase, 
the second antibody bound to the solid phase is measured. By labeling the second 
antibody with an enzyme, fluorescent substance, radioactive substance, biotin or the 
like, measurement of the second antibody bound to the solid phase may be attained 
by measuring the label. The above-mentioned measurement is conducted for a 

1 0 plurality of standard samples each containing a known concentration of the enzyme, 
and the relationship between the concentrations of the enzyme in the standard 
samples and the measured amounts of the label is plotted to prepare a calibration 
curve. The enzyme in a test sample may be quantified by applying the measured 
amount to the calibration curve. It should be noted that the above-mentioned first 

1 5 antibody and the above-mentioned second antibody may be exchanged. In 

agglutination immunoassays, the antibody according to the present invention or an 
antigen-binding fragment thereof is immobilized on particles such as latex particles, 
and the particles are reacted with a sample, followed by measurement of the 
absorbance. The above-mentioned measurement is conducted for a plurality of 

2 0 standard samples each containing a known concentration of the enzyme, and the 

relationship between the concentrations of the enzyme in the standard samples and 
the measured absorbance is plotted to prepare a calibration curve. The enzyme in a 
test sample may be determined by applying the measured absorbance to the 
calibration curve. 

2 5 The reagents necessary for each type of immunoassay are also well-known in 

the art. Except for the antibody used, the immunoassay according to the present 
invention may be carried out using an ordinary kit for immunoassay. For example, 
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such an immunoassay kit may usually include buffer solution, solid phase, labeled 
second antibody and the like. 

As will be concretely described in the Examples below, it was confirmed that 
diagnoses of cancers and/or tumors can be attained by using the amount of 
5 expression of the enzyme of the present invention as an index. Thus, the present 
invention also provides a method for diagnosis of a cancer or tumor, comprising 
determining the amount of expression of the gene coding for the enzyme of the 
present invention, in (a) sample cell(s) separated from body. As will be concretely 
described in the Examples below, the tumors which can be detected by the diagnosis 

1 0 method according to the present invention are cancers or tumors for which cancers 
are strongly suspected. As the sample cells, cells of digestive organs are preferred, 
and cells from colon are especially preferred. By applying the diagnosis method to 
these cells, cancers or tumors of digestive organs, especially cancer and/or tumor of 
colon may be diagnosed. The expression amount of the gene may be measured by 

1 5 measuring the amount of the mRNA transcribed from the gene or the amount of the 
cDNA prepared by using the mRNA as a template, or by measuring the enzyme 
produced in the sample cells by an immunoassay using the antibody of the present 
invention. The measurement of the mRNA or cDNA may be carried out using the 
above-described nucleic acid for measurement according to the present invention by 

2 0 the method described above. 
Examples 

The present invention will now be described by way of Examples. However, 
the present invention is not restricted to the Examples. In the following description, 
the nucleic acid having the nucleotide sequence shown in SEQ ID NO:5, for example, 
25 may also be referred to as "SEQ ID NO:5 n for convenience. 

1 . Search of Gene Database and Determination of Nucleotide Sequence of 
[J3GnT-7 
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Using analogous genes which are known pl,3-N- 
acetylglucosaminyltransferase genes, pi,3-galactosyltransferase genes and pl,3-N- 
acetylgalactosaminyltransferase gene, search of analogous genes was carried out on a 
gene database. The used sequences were pl,3-JV-acetylglucosaminyltransferase 
5 genes with accession Nos. : AB049584, AB049585, AB049586 and AB045278; p 1 ,3- 
galactosyltransferase genes of accession Nos. AF1 17222, Y15060, Y15014, 
AB026730, AF145784 and AF145784; and pl,3-iV-acetylgalactosaminyltransferase 
gene with accession No. Y15062 (all of the accession Nos. are of GenBank). The 
search was carried out using a program tBlastn of BLAST, and all of the amino acid 
1 0 sequences corresponding to ORFs (Open Reading Frames) were included in the 
search. 

As a result, EST sequences with GenBank Accession Nos. AK000770 and a 
human genomic sequence ACO 1 7 1 04 were discovered. Thus, using ACO 1 7 1 04, a 
library was screened. 

1 5 The used sample was human antrum cDNA library prepared by a 

conventional method (Yuzuru Ikehara , Hisashi Narimatsu et al, Glycobiology vol. 9 
no. 1 1 pp. 1213-1224, 1999). The screening was carried out by a usual nucleic acid 
probe method using a radio isotope. The concrete procedures were as follows: 
First, using the X phage prepared from a human antrum cDNA library by a 

20 conventional method as templates, PCR was performed using as primers CB-635(5'- 
cagca gctgc tggcc tacga agac- 3') (nt6814-6837 in AC017104) and CB-638 (5'-gcaca 
tgccc agaaa gacgt cgtc-3') (nt722 1-7245). The amplified DNA fragment having a 
size of about 430 bp was labeled with 32 P-dCTP using Multiple DNA labeling 
system produced by AMERSHAM. 

2 5 Using this probe, single plaques which hybridized with this probe were 

picked up from the plaques of X phage formed on E. coli. Existence of the target 
DNA region was confirmed by PCR using the above-mentioned primers CB635 and 
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CB638. Since the phage obtained from the plaques, in which the insertion of the 
DNA fragment was confirmed was constructed by X ZAP II vector (STRATAGENE) 
(Yuzuru Ikehara , Hisashi Narimatsu et al, Glycobiology vol. 9 no. 11 pp. 1213-1224, 
1999), a cDNA clone inserted into pBluescript SK vector can be prepared (excision) 
5 by the method according to the manufacturer's instruction. The recombinant vector 
was prepared by this method, and a DNA was obtained from the obtained colony. 
The cDNA clone was then sequenced (SEQ ID NO:6). 

The SEQ ID NO:6 obtained by the above-described method corresponded to 
nt4828-7052 of ACO 1 7 1 04 and lacked the 3' region of ORF. Therefore, the 3' 

1 0 region was cloned after amplification thereof by PCR using the cDNA, and was 

ligated. That is, a primer CB-625 (5'-cgttc ctggg cctca gtttc ctag-3') (nt7638-7661) 
corresponding to a region downstream of the termination codon was designed based 
on the sequence expected from ACO 17104 resulted from the search by computer, and 
using this primer in combination with the above-described CB635, a DNA fragment 

1 5 was obtained from the above-described human antrum cDNA library. The obtained 
DNA fragment was sequenced by a conventional method to obtain SEQ ID NO: 7 
(nt6814-7661 in AC017104) (hereinafter referred to as "SEQ ID NO:3"). By 
combining this with SEQ ID NO:6, a theoretical ORF of 978 bp (nt6466-7452 in 
AC017104) was obtained, and a sequence of 328 amino acids was deduced from this 

2 0 ORF, which was named p3GnT-7 (SEQ ID NO:8). It is known that 

glycosyltransferases are, in general, type 2 enzymes having one transmembrane 
segment. However, no hydrophobic region was found in the N-terminal region of 
this ORF sequence. Since it has been reported that p 1 ,3-N- 
acetylglucosaminyltransferase activity is detected in human serum (Human Serum 

2 5 Contains TV-Acetyllactosamine: p 1 ,3-N-Acetylglucosaminyltransferase Activity. 

Hosomi, O., Takeya, A., and Kogure, T. J. Biochem.95, 1655-1659(1984)), the 
enzyme encoded by this ORF was a secretory type enzyme having no transmembrane 
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region. 

To show that the ORF having the sequence shown in SEQ ID NO: 8 and the 
amino acid sequence encoded thereby actually exist and function (i.e., expressed), 
existence of the mRNA was checked by RT-PCR and confirmation of the PCR 
5 product by a restriction enzyme, and by direct sequencing (usual method) of the PCR 
product was carried out. As a result, it was confirmed that the above-described 
theoretical ORF surely existed and actually functioned. 

As mentioned above, although it is known that glycosyltransferases are, in 
general, type 2 enzymes having one transmembrane segment, there is no hydrophobic 

1 0 region in the N-terminal region of the amino acid sequence shown in SEQ ID NO:8, 
so that the enzyme was thought to be different from the usual glycosyltransferases. 
Thus, whether a splicing variant having a hydrophobic region (transmembrane 
segment) in the N-terminal region exists or not was checked by analyzing the 
nucleotide sequence in the 5' region (i.e., the N-terminal region of the amino acid 

15 sequence). 

First, using Human stomach Marathon-Ready cDNA (CLONETECH), 5*- 
RACE (Rapid amplification of cDNA ends) was performed. More particularly, 
using the API primer included in Marathon cDNA (an adaptor API was attached to 
the both ends of the DNA fragment, and an adaptor AP2 was attached to the both 

2 0 inner ends thereof) and a primer p3GnT-7RACE-5 (5»-GACCG ACTTG ACAAC 
CACCA GCA-3') corresponding to the found sequence region, PCR was performed 
(94°C for 60 seconds, 5 cycles of 94°C for 30 seconds-72°C for 3 minutes, 5 cycles of 
94°C for 30 seconds-70°C for 3 minutes, and 25 cycles of 94°C-68°C for 3 minutes) 
was performed. The obtained DNA product was subjected to nested PCR (94°C for 

25 60 seconds, 5 cycles of 94°C for 30 seconds-72°C for 3 minutes, 5 cycles of 94°C for 
30 seconds-70°C for 3 minutes, and 15 cycles of 94°C-68°C for 3 minutes) using the 
AP2 primer included in Marathon cDNA and a primer p3GnT-7RACE-4 (5- 
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GTAGA CATCG CCCCT GCACT TCT-3'). The obtained product was cloned into 
pGEMeasy (CLONETECH) and sequenced. As a result, the sequence upstream of 
the initiation codon of the earlier discovered SEQ ID NO: 6 was obtained, and a 
transmembrane region was observed when deduced into amino acid sequence. 
5 However, although the 5' region of the nucleotide sequence in the vicinity of the 

transmembrane region was analyzed, the initiation codon of the ORF was not found. 

Thus, using GeneScan, HMMgene and the like which were softwares for 
analyzing gene regions, the translation region of the human genomic sequence 
AC017104 containing p3GnT-7 was analyzed. As a result, a first exon of 1 1 bases 

10 (about 3 amino acid) (nt4331-4341 of AC017104) containing the initiation codon 
was expected. Thus, using a primer corresponding to an upstream region of the 
initiation codon, PCR was performed in order to determine whether the expected 
region existed as a transcript. 

More particularly, PCR (30 cycles of 95°C for 30 seconds, 60°C for 30 

1 5 seconds, 72°C for 60 seconds) was performed using as primers p3GnT-7RACE-8 (5'- 
GCCCA GAGCT GCGAG CCGCT-3') (nt4278-4300 in AC017104) and CB-638 (5'- 
GCACA TGCCC AGAAA GACGT CG-3')((nt7224-7245 in AC017104), as a 
template Human leukocyte Marathon-Ready cDNA, and LA-Taq (TaKaRa). As a 
result, an amplification product having a size of 1046 bases was obtained. This 

2 0 PCR product was purified and sequenced. It was proved, as expected from the 
above-described analysis of the translation region, the 3 '-side (nt4341) in the first 
exon was ligated to nt6258 in a downstream region. 

By combining SEQ ID NOs: 6 and 7 and this result, the nucleotide sequence 
having 1206 bases shown in SEQ ID NO:5 and the amino acid sequence having 401 

2 5 amino acids shown in SEQ ID NO:9 were obtained. The SEQ ID NO:5 was one in 
which the upstream regions of 219 bases (73 amino acids) (nt4331-4341 and nt6258- 
6465 in AC017104) were ligated to SEQ ID NO:8 (combination of SEQ ID NOs:6 
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and 7), and it was thought that nt4342-6257 was spliced. Since SEQ ID NO:5 
contains a transmembrane segment (nt6265-6322 in AC017104), SEQ ID NO:5 and 
SEQ ID NO: 8 were thought to be the transmembrane type and secretory type having 
the same activity, respectively. 
5 2. Insertion of p3GnT-7 into Expression Vector 

To examine the activity of p3GnT-7, p3GnT-7 was expressed in insect cells. 
Although it is thought that the activity may be confirmed enough by expressing the 
active region from the 1 1 9th amino acid to the C-terminal of SEQ ID NO:9, which 
region is relatively well conserved in the other genes of the same family, the active 
1 0 region from the 75th amino acid to the C-terminal of (53GnT-7 (SEQ ID NO:9) was 
expressed. 

The gene was incorporated into pFastBac of Gateway system from 
INVITROGEN, and then a Bacmid by Bac-to-Bac system from INVITROGEN was 
prepared. 

15 (D Preparation of Entry Clone 

PCR was performed using p3GnT-7S primer (5*-GGGGA CAAGT TTGTA 
CAAAA AAGCA GGCTT Cgcct ctcag gggcc ccagg cct-3 f ) and p3GnT-7A primer 
(5 -GGGGA CCACT TTGTA CAAGA AAGCT GGGTC catgg gggct cagga gcaag 
tgcc-3') (the nucleotides shown in capital letters were the added sequence attL for 

2 0 GATEWAY hereinbelow described), and as a template the DNA of p3GnT-7 clone 
(the clone containing the theoretical ORF sequence) generated from the cDNA clone 
obtained by the screening and the DNA fragment obtained by PCR, to obtain an 
amplification product. 

This product was incorporated into pDONR201 by BP clonase reaction to 

2 5 prepare an "entry clone". The reaction was carried by incubating a mixture of 5 jil 
of the desired DNA fragment, 1 |il (150 ng) of pDONR201, 2 ^il of reaction buffer 
and 2 \il of BP clonase mix at 25°C for 1 hour. After adding 1 |il of Proteinase K, 
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the reaction mixture was left to stand at 37°C for 10 minutes, thereby terminating the 
reaction. 

Then the whole mixture (11 jal) was mixed with 100 \xl of competent cells (E. 
coli DH5a), and after heat shock, the mixture was plated on an LB plate containing 
kanamycin. On the next day, colonies were collected, and existence of the desired 
DNA was directly confirmed by PCR. For double check, the nucleotide sequence of 
the DNA was confirmed, and vector (pDONR-p3Gn-T7) was extracted and purified. 
(D Preparation of Expression Clone 

The above-described entry clone has attL at the both ends of the inserted 
region, the attL being a recombination site used when X phage is cut out from E. coli. 
By mixing the entry clone with LR clonase (a mixture of recombination enzymes Int, 
IHF and Xis of X phage) and a destination vector, the inserted region is transferred to 
the destination vector so that an expression clone is prepared. These operations will 
now be described in detail. 

Firstly, a mixture of 1 ^il of the entry clone, 0.5 jj.1 (75 ng) of pFBIF, 2 |al of 
LR reaction buffer, 4.5 jixl of TE and 2 [il of LR clonase mix were allowed to react at 
25°C for 1 hour, and then 1 \il of Proteinase K was added, followed by incubation at 
37°C for 10 minutes, thereby terminating the reaction (by this recombination reaction, 
pFBIF-p3Gn-T7 is generated). The pFBIF was one obtained by inserting IgK signal 
sequence (MHFQVQIFSFLLISASVIMSRG) and FLAG peptide (DYKDDDDK) for 
purification. The IgK signal sequence was inserted in order to change the expressed 
protein to a secretory protein, and the FLAG peptide was inserted for purification. 
The DNA fragment obtained by PCR using as a template OT3 (5'-gatca tgcat tttca 
agtgc agatt ttcag cttcc tgcta atcag tgcct cagtc ataat gtcac gtgga gatta caagg acgac gatga 
caag-3'), and using primers OT20 (5 f - cgggatccat gcattttcaa gtgcag-3') and OT21 (5- 
ggaat tcttgt catcg tcgtc cttg-3') was inserted using Bam HI and Eco RI. Further, to 
insert the Gateway sequence, Conversion cassette was inserted using Gateway Vector 
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Conversion System (INVITROGEN). 

Then the whole mixture (1 1 was mixed with 100 |al of competent cells (E. 
coli DH5a), and after heat shock, the mixture was plated on an LB plate containing 
ampicillin. On the next day, colonies were collected, and existence of the desired 
5 DNA was directly confirmed by PCR, followed by extraction and purification of the 
vector (pFBIF-p3Gn-T7). 

(3) Preparation of Bacmid by Bac-to-Bac System 

Using Bac-to-Bac system (INVITROGEN), recombination was carried out 
between the above-described pFBIF- and pFastBac, and G10 and other sequences 

10 were inserted into a Bacmid which was able to replicate in insect cells. With this 
system, the desired gene is incorporated into the Bacmid by the recombinant protein 
produced by a helper plasmid, only by incorporating pFastBac into which the desired 
gene was inserted, using the recombination site of Tn7 into an E. coli (DH10BAC) 
containing the Bacmid. The Bacmid contains lacZ gene, so that classical selection 

1 5 based on the color, that is, blue (no insertion) or white (with insertion), of the colony 
can be attained. 

That is, the above-described purified vector (pFBIH-p3GnT-7) was mixed 
with 50 (il of competent cells (E. coli DH10BAC), and after heat shock, the mixture 
was plated on an LB plate containing kanamycin, gentamycin, tetracycline, Bluo-gal 
2 0 and IPTG. On the next day, white single colony was further cultured and Bacmid 
was collected. 

3. Introduction of Bacmid into Insect Cells 

After confirming that the desired sequence was inserted into the Bacmid 
obtained from the white colony, the Bacmid was introduced into insect cells Sf2 1 
2 5 (commercially available from INVITROGEN). That is, to a 35 mm Petri dish, Sf21 
cells in an amount of 9 x 10 5 cells/2 ml (Sf-900SFM (INVITROGEN) containing an 
antibiotic) were added, and the cells were cultured at 27°C for 1 hour to adhere the 
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cells. (Solution A): To 5 [il of the purified Bacmid DNA, 100 \xl of Sf-900SFM 
(INVITROGEN) not containing an antibiotic was added. (Solution B): To 6 jil of 
CellFECTIN Reagent (INVITROGEN), 100 ^1 of Sf-900SFM (INVITROGEN) not 
containing an antibiotic was added. Solution A and Solution B were then gently 
5 mixed and the mixture was incubated for 15 to 45 minutes at room temperature. 

After confirming that the cells adhered, the culture medium was aspirated and 2 ml of 
Sf-900SFM (INVITROGEN) not containing an antibiotic was added. To a solution 
(lipid-DNA complexes) prepared by mixing Solution A and Solution B, 800 |lx1 of 
Sf900II not containing an antibiotic was added and the resultant was gently mixed. 

1 0 The culture medium was aspirated, and diluted lipid-DNA complexes solution was 
added to the cells, followed by incubating the cells at 27°C for 5 hours. Thereafter, 
transfection mixture was removed and 2 ml of culture medium Sf-900SFM 
(INVITROGEN) containing an antibiotic was added, followed by incubating the 
resultant at 27°C for 72 hours. Seventy two hours after the transfection, the cells 

1 5 were peeled off by pipetting, and the cells and the culture medium were collected. 

The cells and the culture medium were centrifuged at 3000 rpm for 10 minutes, and 
the obtained supernatant was stored in a separate tube (this supernatant is the primary 
virus solution). 

To a T75 culture flask, Sf21 cells in an amount of 1 x 10 7 cells/20 ml of Sf- 
2 0 900SFM (INVITROGEN) (containing an antibiotic) were placed, and the resultant 
was incubated at 27°C for 1 hour. After the cells adhered, 800 nl of the primary 
virus was added and the resultant was cultured at 27°C for 48 hours. Forty eight 
hours later, the cells were peeled off by pipetting and the cells and the culture 
medium were collected. The cells and the culture medium were centrifuged at 3000 
2 5 rpm for 1 0 minutes, and the obtained supernatant was stored in a separate tube (this 
supernatant was used as the secondary virus solution). 

Further, to a T75 culture flask, Sf21 cells in an amount of 1 x 10 7 cells/20 ml 
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of Sf-900SFM (INVITROGEN) (containing an antibiotic) were placed, and the 
resultant was incubated at 27°C for 1 hour. After the cells adhered, 1000 jal of the 
secondary virus solution was added and the resultant was cultured at 27°C for 72 to 
96 hours. After the culturing, the cells were peeled off by pipetting and the cells 
and the culture medium were collected. The cells and the culture medium were 
centrifuged at 3000 rpm for 10 minutes, and the obtained supernatant was stored in a 
separate tube (this supernatant was used as the tertiary virus solution). Further, to a 
100 ml spinner flask, 100 ml of Sf21 cells at a population of 6 x 10 5 cells/ml was 
placed, and 1 ml of the tertiary virus solution was added, followed by culturing the 
cells at 27°C for about 96 hours. After the culturing, the cells and the culture 
medium were collected. The cells and the culture medium were centrifuged at 3000 
rpm for 10 minutes, and the obtained supernatant was stored in a separate tube (this 
supernatant was used as the quaternary virus solution). 

The primary to tertiary cell pellets were sonicated (sonication buffer: 20mM 
HEPES pH7.5, 2 % Triton X-100 (trademark)) and the crude cell extract was 20-fold 
diluted with H 2 0. The resultant was subjected to SDS-PAGE and then to Western 
blotting using anti-FLAG M2-peroxidase (A-8592, SIGMA) in order to confirm the 
expression of p3Gn-T7 protein. As a result, a plurality of broad bands (thought to 
be due to differences in post-translational modifications by sugar chains or the like) 
centering at the position of about 38-40 kDa were detected, so that the expression 
was confirmed. 

4. Resin Purification of p3Gn-T7 

To 10 ml of the supernatant of FLAG-p3Gn-T7 of the quaternary infection, 
NaN 3 (0.05 %), NaCl (150 mM), CaCl 2 (2 mM), and anti-Mi resin (SIGMA) (50 (al) 
were added and the resulting mixture was stirred overnight at 4°C. On the next day, 
the mixture was centrifuged (3000 rpm for 5 minutes, at 4°C) and the pellet was 
collected. To the pellet, 900 \il of 2 mM CaCl 2 TBS was added and the resultant 
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was centrifuged again (2000 rpm for 5 minutes, at 4°C), and the pellet was suspended 
in 200 \xl of 1 mM CaC^ TBS to obtain a sample (p3GnT-7 enzyme solution) for the 
measurement of activity. 

5 . Search of Acceptor Substrate of 03Gn-T7 
5 As a result of molecular evolutionary analysis comparing p3Gn-T7 with pi,3- 

N-acetylglucosaminyltransferases and pi,3-galactosyltransferases, p3Gn-T7 was 
classified into pi,3-Af-acetylglucosaminyltransferases. Thus, firstly, analysis was 
performed using UDP-GlcNAc as the donor substrate. 

Using the following reaction systems, the acceptor substrate was searched. 

10 As the "acceptor substrate" in the reaction solution described below, each of the 

following was used and whether each of them functioned as the acceptor or not was 
investigated: pNp-a-Glc, j^Np-P-Glc, /?Np-a-GlcNAc, /?Np-P-GlcNAc, pNp-a-Gal, 
^Np-p-Gal, ^Np-a-GalNAc, Bz-a-GalNAc, /?Np-a-Xyl, pNp-p-Xyl, pNp-a-Fuc, Bz- 
a-Man, Bz-a-ManNAc, LacCer, GalCer typel and Bz-p-lactoside (all of them are 

1 5 from SIGMA) and Galp 1 ^GIcNAc-a-pNp (TRONTO RESEARCH CHEMICAL). 

The reaction solution (the numbers in the parentheses indicate the final 
concentrations) contained acceptor substrate (10 nmol), sodium cacodylate buffer 
(pH7.2) (50mM), Triton CF-54 (trademark) (0.4%), MnCl 2 (10 mM), UDP-GlcNAC 
(480 |iM) and UDP-[ 14 C]GlcNAC (175 nCi) and CDP-colline (5 mM), to which 10 

2 0 |il of the p3Gn-T7 enzyme solution and H 2 0 were added to attain a final volume of 
25 |il. 

The reaction mixture was allowed to react at 37°C for 5 hours, and after 
completion of the reaction, 200 [il of 0.1 M KC1 was added, followed by light 
centrifugation and collection of the supernatant. The supernatant was passed 
2 5 through Sep-Pak plus CI 8 Cartridge (WATERS) equilibrated by washing once with 
10 ml of methanol and then twice with 10 ml of H20 5 so as to adsorb the substrate 
and the product in the supernatant on the cartridge. After washing the cartridge 
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twice with 10 ml of H 2 0, the adsorbed substrate and the product were eluted with 5 
ml of methanol. The eluted solution was evaporated to dryness by blowing nitrogen 
gas while heating the solution with a heat block at 40°C. To the resultant, 20 fil of 
methanol was added, and the resulting mixture was plotted on a TLC plate (HPTLC 
5 plate Silica gel 60: MERCK), and developed using a developing solvent having the 
composition of chloroform:methanol:water (containing 0.2% CaC^) = 65:35:8. 
After developing the mixture up to 5 mm from the top end of the TLC plate, the plate 
was dried and the intensity of the radioactivity taken in the product was measured 
using Bio Image Analyzer FLA3000 (FUJI PHOTO FILM). 

10 As a result, it was proved that p3GnT-7 is a p 1 ,3-JV- 

acetylglucosaminyltrasferase having an activity to transfer GlcNAc to Bz-P-lactoside 
and Gaipi-4Glc(NAc)-a-/?Np, that is, an enzyme which transfers GlcNAc to the 
galactose at the non-reducing terminal of Galpl-4Glc(NAc)-R. 
6. Measurement of p3GlcNAcT activity to N-glycan 

15 As the enzyme source, the expressed and purified recombinant enzyme (to 

which the FLAG sequence is fused) was used as in the case mentioned above. As 
the acceptor substrates, commercially available PA-bound sugar chain substrates 
(produced by TAKARA BIO) shown in Table 1 were used. The reaction was 
carried out in a mixture containing 14 mM sodium cacodylate buffer (pH7.4), 0.4% 

2 0 Triton CF-54, 10 mM MnCl 2 , 50 mM UDP-GlcNAc (donor substrate), 20 pmol of 

the acceptor substrate and 100 ng of the enzyme protein solution at 37°C for 16 hours. 
The reaction was terminated at 95°C for 3 minutes, and 80 ^1 of water was added. 
The resulting mixture was passed through Ultra-free MC column (WATERS), and 45 
|il aliquot of the passed solution was subjected to HPLC. The conditions of the 

2 5 HPLC were as described below. The conversion enzyme activity (%) was 

determined using a solution which did not contain UDP-GlcNAc (donor substrate) as 
a control. The results are shown in Table 1 below. 
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(HPLC Conditions) 

Buffer La : 100 mM acetic acid/triethylamine, pH 4.0 

Buffer Lb : 100 mM acetic acid/triethylamine, pH 4.0 (containing 0.5% 1-butanol) 
gradient : 5-55% : Buf. Lb (0-60 min.), 
flow rate : 1 .0 ml/min. 

column : PalPak Type R (TaKaRa Cat. No. CA8000) 
column oven temp : 40°C 

HPLC System: Shimadzu LC-10AD vp.CTO-lOAC vp , DGU-14A, cell temp 
controller 

Detector: Fluorescence: RF-10AXL.UV: SPD-lOAvp 
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Table 1 



Acceptor Substrate 


Conversion 
Activity(%) 


°Man/J 1-4GteNAc/3 t*4GtcNAcPA 
Gal » 1 .4GlcNAc0 1-2Man a \^ 


18.3 


Gal/31-4GlcNAc01 -2Mana1 v 

Gal /J 1 -4GlcNAc/9 1x ^an* 1-4GlcNAc/M-4GlcNAo-PA 

5Mano 1 
Galj9MGlcNAo#1^ 


26.0 j 


Gar/3f-4GlcNAc01\ R 

GaJjSl^GloNActfl^ x s 

Swan $ 1-4GteNAc£ 1-4GtoNAoPA 

ihAZTUXi 


20.3 


Gal ^ 1 -46lcNAc fi 1-2Man« K_ Ric a 1v 

®Man£ 1-4GlcNAc0 f-4GJcNAc-PA 

Gal /J 1 -4GteNAc J9? 1-2Man<* 1^ 


20.6 


Gal£ t-4GlcNAc01 - 2Mana1 ^ Rica Is^ j 
Gal/M-4GteNAc0i x jMan«l^!olWo«MGlcN^PA 

SManal 
Gali3f-4GloNActft^ 


17.3 


Gd/Jf*4G1oNAop1 x 

GaJjSWGIcNActfl"' 2 ^ 6 ^6 

!?Man0 1-4GIcNAc 0 1-4GlcNAc-PA 

GatjSf^GlcNAa/Sl*^ 


18,1 


Manofl^. 

JjMan 0 1«4GlcNAc0 1-4GlcNAo«PA 
Manal^ 


0.0 


Gal 0 1-4GISNAC jS 1-3Gal 0 1«4G!o-PA 
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7. Measurement of Expression of Enzyme by Flow Cytometry 

The p3GnT-7(G10) gene was incorporated into pDEST12.2 vector 
(INVITROGEN) to prepare pDESTl 2.2-G1 0 vector DNA. More particularly, this 
was carried out as follows: Using primers described below containing the sequence 
5 of the Gateway system of INVITROGEN, a cDNA from Colo205 cells (colon cancer 
cells) was amplified by PCR, and the amplification product was first incorporated 
into the pDONR vector by BP reaction. After confirming the DNA sequence by 
sequencing the vector, the insert was transferred from the pDONR vector to 
pDEST12.2 vector by LR reaction. These operations were carried out using the 
1 0 vectors and reagents contained in the kit of INVITROGEN in accordance with the 
instructions included in the commercial product. 
G10/ORF-F1 Primer 

ggggacaagtttgtacaaaaaagcaggcttctggcgcccagagctgcgagccgct 
(In this, ggggacaagtttgtacaaaaaagcaggcttc is a sequence in the vector) 
15 G10/ORF-R1 Primer 

ggggaccactttgtacaagaaagctgggtccatgggggctcaggagcaagtgcc 

(In this, the cDNA sequence of b3GnT7 gene is from catgggggctcaggagcaagtgcc) 
By the above-described procedures, a recombinant vector was obtained in which a 
DNA fragment containing the cDNA shown in SEQ ID NO: 5 to which the region 

2 0 other than the cDNA sequence in the above-described primers was attached to the 5'- 
and 3 '-ends thereof was inserted. This recombinant vector was introduced into 
HCT15 cell line and LSC cell line (both are colon cancer cell lines) by a conventional 
method. As a control, the pDEST12.2 vector DNA in which the gene was not 
incorporated was introduced into the cell lines in the same manner (Mock cells). 

2 5 After carrying out the selection by 0.8 mg/ml of G4 1 8 (INVITROGEN) for one 
month, the cells were harvested. The harvested cells were washed twice with 
l%BSA/0.1%NaN3/PBS(-). The cell population was adjusted to 1 x 10 7 cells/ml, 
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and 100 jal (1 x 10 cells) aliquot thereof was used for one sample. After 
centrifugation, the supernatant was removed and the resultant was diluted to a 
concentration of 10 |ig/ml. To the resultant, 100 jil each of the FITC-labeled lectins 
described below were added, and the cells were suspended. After allowing the 
reaction at 4°C in the dark (refrigerator) for 30 minutes, 100 ^1 of 
1 %BS A/0. 1 %NaN 3 /PBS was added to each well to carry out washing. The 
resultant was centrifuged at 1000 rpm for 5 minutes, and the supernatant was 
removed. The washing was repeated once more. The resulting cells were 
suspended in 1 ml of 0.5% paraformaldehyde/PBS to fix the cells, and analyzed by 
flow cytometry FACSCalibur (BECTON DICKINSON) after passing the cells 
through a nylon mesh. The results are shown in Figs. 1-3. 

The used lectins were Lycopersicon esculentum (LEA) and Triticum vulgare 
(WGA), both of which recognize the repetition of iV-acetyl lactosamine structure, and 
N-acetyl glucosamine structure, and labeled with FITC (purchased from HONEN, 
SEIKAGAKU CORPORATION, EY LABORATORIES and so on). 

Fig. 1 shows the results of the flow cytometry showing the binding property 
between the HCT15 colon cancer cell line and the LEA lectin. Fig. 2 shows the 
results of the flow cytometry showing the binding property between the LSC colon 
cancer cell line transformed with the recombinant vector containing the gene of the 
present invention or the vector not containing the gene of the present invention and 
LEA lectin. Fig. 3 shows the results of the flow cytometry showing the binding 
property between the HCT15 colon cancer cell line and the WGA lectin. In each of 
the drawings, the bold line shows the results of the cells transformed with the 
recombinant vector containing p3GnT-7 gene, and the thin line shows the results of 
the cells (Mock cells) transformed with the vector not containing p3GnT-7 gene. 

As shown in Figs. 1-3, in all of the cases, the fluorescence intensity was 
shifted, which indicates that the N-acetyl lactosamine-containing structure was 
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increased in the cells into which the DNA of pDEST12.2-G10 containing p3GnT- 
7(G10) gene was incorporated. 

8 . Analysis of Tissue-specific Expression of P 3 GnT-7 

The expression of the gene in tissues and in cell lines was examined by Real 
5 Time PCR method (Gibson, U. E., Heid, C. A., and Williams, P. M. (1996) Genome 
Res 6, 995-1001). Human tissue cDNAs used as materials were the Marathon 
cDNAs. From the various cell lines, total RNAs were extracted by a conventional 
method and the cDNAs were synthesized. For obtaining the calibration curve of 
p3GnT-7, a plasmid containing p3GnT-7 gene inserted in pDONR™201 vector 

10 DNA was used. As a control for the endogenous expression, constantly expressed 
human glyceraldehyde-3 -phosphate dehydrogenase (GAPDH)) was used. For 
obtaining the calibration curve of GAPDH, a plasmid containing the GAPDH gene in 
pCR2.1 (INVITROGEN) was used. As the primer set and probe for p3GnT-7, the 
following were used: RT-p3GnT-7-F2; 5'-TTCCTCAAGTGGCTGGACATC-3', 

1 5 RT-p3GnT-7-R2;5'-GCCGGTCAGCCAGAAATTC-3', probe;5'- Fam 

ACTGCCCCCACGTCCCCTTCA -MGB-3*. As the primer set and probe .for 
GAPDH, a kit (Pre-Developed TaqMan® Assay Reagents Endogenous Human 
GAPDH (APPLIED BIOS YSTEMS) was used. The PCR was performed using 
TaqMan Universal PCR Master Mix (APPLIED BIOS YSTEMS) under the 

2 0 conditions of 50°C for 2 minutes, then at 95°C for 10 minutes, and repeating 50 
cycles of 95°C for 15 seconds-60°C for 1 minute. The quantitation of the PCR 
product was carried out using ABI PRIAM7700 Sequence Detection System 
(APPLIED BIOSYSTEMS). The expression amount of Gl 1 was normalized by 
dividing the amount by the amount of the transcription product of the constantly 

25 expressed GAPDH. The results for the human tissues are summarized in Table 2, 
and the results for the cell lines are summarized in Table 3. 



32 



Table 2 
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L/Idlll 


0 01 04 S 




0 04S99 


cereDeiiurn 


0 09^4^ 


lCldl Uldlll 


0 090^0 


uuiic iiiarruw 


0 01 469 


liiyroiu 


0 04084 


^"rT* rt"**t lie 

inymus 


0 01 974 
U.U 1 Z /*t 


spieen 


0 1 01 OR 
U.1U1 vo 


icUKOcyxe 


0 075*76 


heart 


0 00CK6 
U.UW7 JO 


SKeietai muscie 


O 00O71 
U.UUU / 1 


lung 


O 1 9 1 /1A 


liver 


O 009QO 


esophagus 


U.UU0LO 


stomacn 


u.zoyzz 


small intestine 


O OQ'*'*'* 

U.I/7JJJ 


colon 


O 07A10 


pallL/ICdb 


0 97^1 7 


kidney 


0.01161 


adrenal 


0.15069 


mammary gland 


0.02560 


uterus 


0.07747 


placenta 


0.18763 


ovary 


0.11465 


testis 


0.05323 



The tissues in which |33GnT-7 was highly expressed were pancreas, stomach, 
placenta and adrenal, and the tissues in which p3GnT-7 was moderately expressed 
were colon, leukocyte, lung, ovary, small intestine, spleen, testis, uterus and cerebral 
cortex. In the tissues other than these tissues, the expression amount was relatively 
low. 
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Table 3 



Cell (origin) 


P 3 GnT-7/G APDH 


GOTO (neuroblastoma) 


0.00012 


SCCH-26 (neuroblastoma) 


0.00137 


T98G (glioblastoma) 


0.00032 


U251 (glioblastoma) 


0.00023 


Leukemia (premyeloblastic leukemia) 


0.35660 


Melanoma (skin) 


0.01255 


HL-60 (premyeloblastic leukemia) 


0.17663 


K562 (leukemia) 


0.00038 


U937 (monocyte) 


0.01617 


Daudi (B cell (Burkitt's)) 


0.00437 


PC-1 (lung) 


0.00000 


EBC-1 (lung) 


0.00121 


PC-7 (lung) 


0.00017 


HepG2 (liver) 


0.01199 


A431 (esophagus) 


0.01031 


MKN45 (stomach) 


0.00027 


KATOIII (stomach) 


0.03964 


HSC43 (stomach) 


0.00031 


Colo205 (colon) 


0.00278 


HCT15 (colon) 


0.00193 


LSC (colon) 


0.00003 


LSB (colon) 


0.00128 


SW480 (colon) 


0.00045 


SW1 116 (colon) 


0.13076 


Capan-2 (pancreas) 


0.03664 


PA-1 (uterus) 


0.00290 



Expression of p3GnT-7 in cell lines was lower than that in normal tissues. 
In HL60 cells originated from premyeloblastic leukemia and in SW1 116 cells 
originated from colon, the expression level was high. 

It was easily thought that the expression amount of p3GnT-7 is changed when 
the degree of differentiation is changed by cancerization or the like, so that there is a 
possibility that measurement of the expression amount of p3GnT-7 may be used for 
diagnoses of diseases. Further, as described above, there is a possibility that there 
are two initiation sites in p3GnT-7, so that there is a possibility that by measuring the 
change of the splicing variants, the state of differentiation and pathological change of 
the cells may be measured. 
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9. Expression of (33GnT-7 Gene in Normal Tissues and Cancer Tissues of Colon 
Cancer Patients 

The expression amounts of p3GnT-7 in normal (N) tissues and cancer (T) 

tissues of actual colon cancer (DK) patients were measured by the method described 
in f, 8. Analysis of Tissue-specific Expression of p3GnT-7". The results are 
shown in Fig. 4. From these results, in samples except for DK3, that is, in samples 
of DK10, DK15, DK19, DK22 and DK23, the tendency that expression of p3GnT-7 
in cancer tissue is smaller than in the normal tissue was observed. 



